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ABSTRACT 

Double-Fourier interferometry is the most viable path to sub-arcsecond spatial resolution for future 
astronomical instruments that will observe the universe at far-infrared wavelengths. The double 
transform spatio-spectral interferometry couples pupil plane beam combination with detector arrays 
to enable imaging spectroscopy of wide fields, that will be key to accomplishing top-level science goals. 
The wide field of view and the necessity for these instruments to fly above the opaque atmosphere 
create unique characteristics and requirements compared to instruments on ground-based telescopes. 
In this paper, we discuss some characteristics of single-baseline spatio-spectral interferometers. We 
investigate the impact of intensity and optical path difference noise on the interferogram and the 
spectral signal-to-noise ratio. We apply our findings to the special case of the Balloon Experimental 
Twin Telescope for Infrared Interferometry (BETTII), a balloon payload that will be a first application 
of this technique at far-infrared wavelengths on a flying platform. 

Subject headings: Interferometry, double Fourier, balloon, phase noise, interferogram 


1. INTRODUCTION 

Observations at mid- to far-infrared wavelengths 
from the Earth’s surface are extremely limited by the 
large atmospheric opacity in this region of the spec¬ 
trum. Space-based te lescopes like IRAS (12-100 nm; 
Neugebauer et al.lll984H . IS O 12.5-240 nmdKessler et all 
1996ft . Svitzer (3.6-160 /iimlWerner et al.ll2004ft . AKARI 
('1.7- 180 urn: IMurakami et all 1200111 WISE (3.4-22 
am; IWright et al.lJ2010T) and Herschel (55-672 fin i; 
iPilbratt et all 12010 1 have demonstrated the scientific 
value of observations at these wavelengths; but the 
spatial resolution of space-based observatories is lim¬ 
ited by the cost and complexity of building and fly¬ 
ing progressively larger aperture telescopes. Inter¬ 
ferometry is a common solution to this problem on 
the ground, and is a viable path forward to obtain 
much higher resolution than what single apertures can 
provide. In particular, spa tio-spectral interferometry 
IlMariotti and Ridgwavi Il988h is a way to achieve high 
angular and spectral resolutions at far-IR wavelengths 
from above the atmosphere, without the cost and limi¬ 
tations of large single apertures. 

Several space-based interferome ter concepts, the 
Far I nfrared Interferometer (FIRI; iHelmich and Ivisonl 
I2009T) . t he Space Infra-red I nterferometer Telescope 
(SPIRIT: ILeisawitz et al.l 1200711 . and the Submillimeter 
Probe of the Evo lution of Cosmic Structure (SPECS; 
lHarwit et al.ll2006ft . have been proposed and use spatio- 
spectral interferometry to achieve the much needed an¬ 
gular resolution to study astronomical processes such as 
the birth of stars and planetary systems, the activity in 
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galactic nuclei and the formation of galaxies in the dis¬ 
tant universe. The FIRI and SPIRIT concepts have two 
mirrors which are movable on one axis along a monolithic 
truss to provide a range of baseline lengths. SPECS con¬ 
sists of three spacecraft connected via tether to achieve 
baselines of order 1 km. 

There are numerous engineering challenges to be ad¬ 
dressed before such missions can become reality. A 
number of them ca n be tackled with te stbeds (e.g. 
ILeisawitz et al.l 120121 : iGrainger et al.l 12012ft and small- 
scale pathfinder missions. These missions will likely be 
two-element, single baseline interferometers in space or 
on balloon platforms, such as the Balloon Experimental 
Twin Telescopes for Infrared Interferometry (BETTII; 
iRlnehart et all 120141 ) and to a certain extent the Far- 
Infrared Interfer ometric Telescope Experiment (FITE; 
iKato et al.ll20l(ift . These pathfinders will have very lim¬ 
ited baseline coverage and rather than producing full im¬ 
ages, they will focus on reconstructing spectral informa¬ 
tion from closely-spaced sources. This paper explores 
aspects of the noise in spectral measurements specific to 
these instruments. 

1.1. Spatio-spectral interferometry 

In their pioneering paper, iMariotti and Ridgwavi 
(11988ft lay out the principles of spatio-spectral (or double- 
Fourier) interferometry. A spatio-spectral interferome¬ 
ter consists of a Fourier transform spectrometer (FTS), 
where a delay line mechanism modulates the optical path 
difference (OPD) between two independent light beams 
before combining them in the pupil plane. The instru¬ 
ment produces interferograms, which are arrays of power 
measurements as a function of the OPD. Unlike tradi- 
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tional FTS, where a single incoming beam is split, delay- 
modulated, and recombined, a double-Fourier interfer¬ 
ometer utilizes multiple light collectors pointing to the 
same astronomical source and combines the incoming 
light from the collectors pairwise in the pupil plane. The 
orientation and magnitude of the baselines - the vectors 
between each pair of light collectors - determines which 
spatial frequency of the astronomical image the instru¬ 
ment measures. Longer baselines correspond to higher 
angular resolutions. The “double-Fourier” aspect comes 
from the fact that the interferogram measured on a given 
baseline is related to the Fourier Transform (FT) of the 
spatial and spectral distribution of the source emission. 
Two FTs are used to reconstruct the full spatio-spectral 
datacube representing the astronomical scene: the spec¬ 
tra which are more directly related to the power as a 
function of time delay difference between the two incom¬ 
ing beams (equivalent to the OPD) and the source 2D 
spatial structure on the sky which is more directly re¬ 
lated to measurements accumulated from many different 
baseline vectors. The length of the baseline vectors can 
be changed by modifying the distance between the light 
collectors. The orientation of the vectors can be changed 
by rotating the baseline with respect to the source on the 
sky. The plane representing the source’s interferometric 
visibilities as a function of baseline vector is referred to as 
the ( u , i>)-plane and is a common notion in ground-based 
submillimeter and radio interferometry. This paper fo¬ 
cuses on the reconstruction of the spectrum from closely- 
spaced point sources using single-baseline measurements, 
and does not address the techniques and sensitivities in¬ 
volved in using multiple baseline lengths to produce an 
image of the scene; a mathemat ical formalism tha t covers 
imaging is already proposed in lElias et al. (2007). 

Proposed double-Fourier instruments at far-IR wave¬ 
lengths distinguish themselves from operating interfer¬ 
ometers at sub-millimeter and radio wavelengths in sev¬ 
eral ways. First, they do not directly measure the phase 
information. The fundamental measurement is a time 
series of real-valued power as a delay line modulates 
the OPD in a controlled sequence (for example a linear 
ramp). The OPD from the delay line, as well as other 
OPD contributors in each arm of the instrument, and the 
external OPD created when the line of sight to a source 
is not perpendicular to the baseline vector, add up to the 
total OPD. For a given detector location along the pro¬ 
jected baseline vector, there exists a value of the OPD in 
the delay line that exactly compensates all other OPD 
contributors. This delay line position results in a zero net 
total OPD, and is called the Zero Path Difference (ZPD). 
At this value of OPD, an incoming plane wave traverses 
the two beam paths reaching the detector exactly with 
the same phase, for all wavelengths. ZPD corresponds to 
the center of an interferogram for a source at that detec¬ 
tor location. In the context of this paper, the phase for a 
given wavelength cf>\ is related to the OPD between the 
beams from each arm when they combine, at the time of 
a data point measurement: (j>\ = 27rOPD/A. 

A second important difference for balloon and space in¬ 
terferometers is that collectors are not fixed to the Earth. 
In the case of BETTII and SPIRIT, the collectors are 
fixed to a truss structure which is part of the mechan¬ 
ical system for pointing the collectors. Consequently, 
baseline length and external OPD, as relevant to an as¬ 


tronomical source, are not independent of pointing er¬ 
rors. The impact of errors in baseline length is modest 
because the relevant measure is in terms of fractions of 
the collector diameter. Errors in pointing translate into 
external OPD as the sine of the error angle times the 
baseline length, while the relevant measure is the wave¬ 
length. This can easily become significant; for example, 
a 1” pointing error for an 8 m long baseline corresponds 
to a 38 pm shift in OPD. 

Third, bolometer-type detectors, such as being built 
for BETTII and envisioned for SPIRIT, are easily, and 
indeed typically, configured as two-dimensional arrays. 
With pupil plane combination, the entire field of view has 
an interferometric response; hence wide-held interferom¬ 
etry over multi-pixel arrays is straightforward. Fig. [l] 
shows this concept and sketches the instrumental re¬ 
sponse. For the configuration shown with the detector 
array columns aligned perpendicular to the baseline vec¬ 
tor, ZPD is the same along lines perpendicular to the 
baseline vector projected on the detector. As the OPD 
is swept, it moves across ZPD for the different columns 
in the array, yielding interferograms with shifted centers 
corresponding to the changes in external OPD for each 
source location in the held. 

By sweeping the OPD, the double-Fourier instrument 
measures interferograms which contain both spectral and 
spatial information over the detector array. The full 
spatial and spectral source information can be unam¬ 
biguously recovered by repeating the delay line sweep 
over a range of baseline angles and lengths, which 
correspond to different spat ial frequencies on the sky 
dMariotti and RidgwavllT988l l. 

1.2. The case study: BETTII 

The BETTII project dRinehart et al.ir2014T ). is a mo¬ 
tivation for this paper and a near-term application of 
spatio-spectral interferometry. BETTII consists of two 
50 cm siderostats on a hxed 8 m baseline, with a far-IR 
beam-combining instrument at the center. It will observe 
the far-IR universe in two wavelength bands, 30-55 pm 
(“band 1”) and 60-90 pm (“band 2”). The instrument 
is currently under construction at NASA Goddard Space 
Flight Center and is scheduled to launch in the Fall of 
2016 on a stratospheric balloon from Fort Sumner, New 
Mexico, to an altitude of 35 km in order to be above most 
of the atmosphere. For its hrst flight, BETTII will focus 
on the study of dense star formation in nearby clusters. 
While a complete image reconstruction is not possible 
due to the static baseline length, BETTII will help re¬ 
solve point source objects that are 0.5-1” apart in band 1 
and 2, respectively, more than ten times the spatial res¬ 
olution of Spitzer at 24 pm and six times the resolution 
of SOFIA at 37 pm. Combined with a modest spectral 
resolution of 1Z = 10 — 50, BETTII will measure the 
spectral energy distributions (SEDs) of clustered young 
stars to determine their evolutionary stage, locate the 
origin of the far-IR emission, and improve our under¬ 
standing of how stars accrete their mass in thes e very 
dense regions of stellar birth (e.g. see iTan et all 12014 
and references therein). For resolved sources, the fixed 
baseline will not completely lift degeneracies between the 
spectral and spatial information; however detailed source 
modeling can put c onstraints on the dis tribution of the 
far-IR emission (e.g lWhitnev et al~ll20 1 .'ll ). 
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In this paper, we study how various types of noise prop¬ 
agate to the derived spectrum in an instrument like BET- 
TII or SPIRIT. In section[2] we establish a mathematical 
formalism that can be used to represent interferograms. 
In section 3, we look at the dominant types of noise in 
the interferogram and define the relevant timescales asso¬ 
ciated with spatio-spectral interferometers. In section 4, 
we derive the spectral signal-to-noise ratio (SNR). In 
section 5, we apply these results to the special case of 
BETTII to derive its point source spectral sensitivity. 

2. MATHEMATICAL FORMALISM 

The general optics layout for a double-Fourier system 
is shown in Fig. [2] for a single baseline. The combination 
of the siderostat and beam compressor acts as an afocal 
telescope which outputs a parallel beam with a diam¬ 
eter convenient for the rest of the optical train. The 
K-mirror in one beam path corrects for the pupil rota¬ 
tion so that the images of the sky from the two collectors 
are matched over the field of view. At the center of the 
instrument, there are optics for pupil re-imaging, filter¬ 
ing, and beam folding, as required by the specific imple¬ 
mentation. The key components for our purpose are the 
delay line, beam combiner and detectors. The delay line 
introduces a controlled OPD between both arms. The 
two incoming beams are combined in the outputs from 
the beam combiner. We arbitrarily define one output 
as the “+” and the other as the To conserve pho¬ 
ton energy, the two outputs must be complimentary such 
that the summed power of the two is independent of the 
OPD. In an ideal double-Fourier system, the two beam 
paths are symmetric about ZPD; hence, the power from 
the “+” and outputs are equal at ZPD, and have 
odd symmetry about ZPD. In a traditional FTS at ZPD, 
one output has fully constructive interference while the 
other has fully destructive interference, with even sym¬ 
metry about ZPD. 

2.1. Interferograms for a two-aperture instrument 

The interferogram for a single frequency of light mea¬ 
sured at the outputs of the ideal, two-element double- 
Fourier instrument can be described in terms of the nor¬ 
malized intensity: 

I±(x,a) = Re{l±iV B (a)e~ 2inax ), (1) 

where a = j is the wavenumber of the light in cm -1 as 
per the convention for the FTS literature, x is the in¬ 
strumental OPD created by the delay line with x = 0 
corresponding to ZPD, and Vb(ct) is the complex spatial 
visibility of the astronomical source for the baseline vec¬ 
tor B. “Re(/)” indicates the real part of the complex¬ 
valued function /. The ± indicates values for the two 
output beams: “+” and in Fig. [2j The derivation of 
this expression is given in Appendix [A] 

The normalized complex spatial visibility Vb has a 
magnitude of 1 for all baselines for which the source is 
completely unresolved. For extended sources, the spa¬ 
tial visibility depends on the source geometry, intensity 
distribution, and the i nstrument base line vector as de¬ 
scribed in Chapt e r 2 oflLawso 3 (|2000f) and Chapter 3 of 
iThompson et al.l (1200811 . For a normalized source bright¬ 
ness distribution T, the spatial visibility with respect to 


a phase reference position on the sky can be written as: 

Vb (<t) = f dO4(0.F(0e- W ' B , (2) 

J source 

where A is the normalized reception pattern of the col¬ 
lecting area; B is the baseline vector between the two 
collectors and £ is the vector on the plane of the sky from 
the phase reference position to the infinitesimal solid an¬ 
gle dll. The resulting visibility as a function of baseline 
vector is the 2-dimensional FT of the source’s sky dis¬ 
tribution. Since J- does not have to be symmetric with 
respect to the chosen phase center, Vb is in general com¬ 
plex and can be expressed as an amplitude and a phase, 
$b(ct): Vb(ct) = |V B (a)|e i$B ( CT ). 

Real instruments have asymmetries, imperfections, 
and measurement errors which can create phase-shifts 
between the two optical paths and across the pupils. 
Fixed instrumental effects can be represented by a nor¬ 
malized instrumental visibility loss term, Vfa) where the 
complex quantity Vi(cr) = |Vi(g )| e I$ V^ , as described in 
details in Chapter 3 of iLawsonl (12000D . represents both 
amplitude losses and phase shifts (see Appendix lAl). Ad¬ 
ditional phase errors can arise from imperfect knowledge 
of the real-time optical path lengths which we will rep¬ 
resent as e ® < M <T > x ) j where Q r (a,x) is the “phase noise”; 
this term depends on the OPD x through time-dependent 
phenomena such as mechanical jitters, temperature vari¬ 
ations in the optics support, or pointing errors. In the 
rest of this paper, we will mostly talk about this “OPD 
noise”, which is the physical source of the noise, whereas 
phase noise represents its effects on the interferogram. 
The total complex visibility sampled at a single <j by the 
system is VB(cr)Vi(cr)e* <I>r (' T ’ x ^ and it is normalized such 
that, for an ideal instrument observing a point source, 
this quantity is equal to 1 at ZPD. 

Using Eq. |T] for the monochromatic source, the poly¬ 
chromatic interferogram is the integral over a of this 
dimensionless response at each wavenumber. The total 
amount of power coming into the 2-aperture interferom¬ 
eter within a small wavenumber range da is 2AB{a)cda 
where 2A is the total aperture area in m 2 , B(a) is 
the spectral flux density in W-m _2 -Hz _1 and c is the 
speed of light in cm-s _1 . Filters and optics in an instru¬ 
ment cause a wavenumber-dependent transmission pro¬ 
file 7b p (er). The quantum efficiency of the detector can 
depend on wavenumber, r]D(a). For multi-pixel detec¬ 
tors the interferogram is measured by matched filtering 
a point-spread function on a pixel array, which has some 
efficiency r) m f. 

The actual power measured by the instrument can be 
represented as: 

r+oo 

I±{x) = Ac r] m {-q D Tb v Bx 
Jo 

Re [(1 ± iV i V B e i<I>r e -2 * 7rcra )] da, (3) 

where the factor of 2 for the two apertures is dropped 
because it is implicit in Eq. [L] All quantities within the 
integral can be functions of wavenumber. All the static 
interferometric loss terms and delay-dependent phase er¬ 
rors are in V; and e l4>r , respectively. 

Instead of considering each separate output, we use / = 
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as our interferogram expression, which cancels out 
the constant term. We also introduce an interferometric 
instrument transmission function, which can be complex, 
which represents the normalized amplitude and phase of 
the interferogram for a point source of uniform spectrum 
and no phase noise: 

T inst (a) = AauenoTbpVi = \T inst (a)\e ^^, (4) 

We can then write the modulated signal as: 

I{x) = Re ^2 £ i\T inst \BV^ +i ^e-^ x da^ , 

(5) 

where B is real and V B can be complex. 

Eq. [5] can be turned into a Fourier transform 
by mirroring all quantities to negative w avenumbers. 
This convention is explained in detail in iDavis et al.l 
(2001ft for FTS instruments; the odd symmetry of 
the interferogram for a system with one beam com¬ 
biner and the complex instrumental transfer function 
means that the incident spectrum on the detectors 
must be mirrored to -a as the negative of the com¬ 
plex conjugate of +a: S e (a) = [T ins tBV B ] e (<T) = 

I lTin S t(<r)B(a)V B (a) - T* nst (-a)B(-a)V£(-a)}. We 
use the subscript “e” to denote the reflected function, 
and will apply this convention in the rest of this paper; 
this reflection ensures that the integrals keep the same 
value when are expressed from — oo to +oo, and does not 
affect the SNR estimates: although the signal appears to 
be divided by a factor of two, so is the noise, as it is 
spread between positive and negative frequencies. The 
interferogram expression is then: 

/( x) = Re £ + iS e e~ 2inax+i ^ r da'j . (6) 

For the ideal case where 4v = 0, this expression shows 
that the interferogram is actually the real part of the 
Fourier transform of the spectrum at the detector, iS e . 

2.2. Measured interferograms 

In practice, the interferogram data are discrete mea¬ 
surements of a real-valued signal on the detectors. Like 
for most FTS instruments, each data point on the in¬ 
terferogram corresponds to an integration of the detec¬ 
tor while the delay line is continually in motion. This 
decreases the amplitude of the interferogram due to 
the local smearing of the fringes, but it can be kept 
to low values by increasing the fringe sampling. At 
each OPD x n , the interferogram has a measured value 
T(x n ) = £-+£/’ I{x)dx. To first order, this has the 

effect of multiplying the power at each wavenumber by 
sinc(7rcrdx). For the purpose of this paper, we consider 
this term to be included as part of the instrumental trans¬ 
mission Tinst- Note that the value of the optical delay x n 
is the path difference from ZPD, not the physical location 
of the delay line, since there could be a multiplying factor 
between the two due to beam folding (e.g., for BETTII, a 
motion of 1 mm of the delay line creates 4 mm of OPD). 

A discrete Fourier transform (DFT) is used to trans¬ 
form a discrete interferogram of N measurements into a 
complex discrete spectrum with N points. The resolving 
power of the instrument, 1Z = X/dX , is dependent on the 


physical length scanned by the delay line L: 1Z = La /2 
for a scan with symmetric length on both sides of ZPD. 
For these instruments where we scan through the whole 
interferogram, the data should be sampled at least at the 
Nyquist rate for the interferogram response frequency of 
dx = A/2. For a sampling exactly equal to Nyquist, we 
have the relationship: N = 41Z. 

For a double-Fourier instrument, as shown in Fig. [l] 
the ZPD for different columns on the array occurs at 
different delay positions cc co i, related to the projected 
baseline length. The simplest way to express this is in 
terms of the angular offset on the sky of each column, £, 
along the direction of the baseline, B: 

= |B| = |B|e = 48.7 M m (j^L) (j^) . 

(?) 

where we have filled in practical units for an infrared in¬ 
strument. For a far-IR interferometer working at 50 /am, 
with 1-2 m diameter collectors, the delay shift across the 
collector point spread function (collector angular reso¬ 
lution) is several to ten wavelengths. Hence the scan 
length to cover a wide-held array detector is comparable 
to the scan length required to achieve IZ’s of 100’s to 
1000’s. This property is an important consideration for 
observation and data analysis strategies. 

The ideal interferogram for a point source from a per¬ 
fect instrument is an odd function of the OPD x, so its 
DFT is purely imaginary. The noise in the interferogram 
will be converted into spectral noise in both the real and 
imaginary axes so the real axis is a proportional measure 
of the noise. Referring back to Eq.[6l phase shifts caused 
by the instrumental transfer function and source spatial 
visibility will break the anti-symmetry; in practice, the 
DFT of a measured interferogram is complex and the 
real and imaginary parts are of interest. The scientifi¬ 
cally interesting quantities are the source spectrum and 
source spatial visibility: B and V B ; the fixed instrumen¬ 
tal terms have to be calibrated or properly modeled by 
observing a bright point source of known spectrum. The 
techniques for calibrating F TS systems are well devel¬ 
oped fe.g. IDavis et al.ll2001lh and there are many meth¬ 
ods proposed to correct some phase and am plitude errors 
(e.g. iForman et al.lllOOfit lSromovskvll2003H . 

The phase noise term <h r (x, a) in Eq.0 and the SNR in 
the measured interferogram can have significant impact 
on the ability to recover the source spectrum with a real 
instrument. The upper panel in Fig. [3] shows an example 
of an interferogram (left), and the transformed S e (ak) 
(right) for a source with flat power spectrum, multiplied 
by a flat bandpass function with smoothed edges. The 
middle panel of Fig. [3] shows the same source and instru¬ 
ment parameters as the upper panel, now with an as¬ 
sumed Gaussian OPD noise of standard deviation equal 
to 10% of the central wavelength of the band Ao = ^ 
(i.e., there is a Ao/10 OPD uncertainty for each data 
point in the interferogram). The lower panel is the top 
panel observed with a incoherent background noise corre¬ 
sponding to SNR = 10 at the peak of the interferogram, 
and no OPD noise. The next sections of this paper will 
analyze these noise contributions and quantify their im¬ 
pact on the derived spectrum. 

3. NOISE SOURCES 
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The two primary types of noise in a double-Fourier 
instrument are intensity and OPD noise. The inten¬ 
sity noise consists of the astronomical and thermal back¬ 
ground noise, the photon noise from the source, and the 
detector noise. The OPD noise arises primarily from un¬ 
certainties and changes in OPD, which would prevent us 
from accurately knowing the x-values of measurements 
in the interferogram before the FT. For convenience, we 
usually refer to the OPD noise as a percentage of the car¬ 
rier wavelength. In the rest of this paper, a “10% OPD 
noise” signifies that the OPD for each measurement in 
the interferogram is known to within an error of 10% of 
the carrier wavelength, or 10% of one full fringe cycle. 


3.1. Intensity noise 

The measured signal has units of power and can be 
represented as the interferometric signal with additive 
noise: 

^measured(^n) = DiXn) T (8) 

with nx being the difference of the noise in the two out¬ 
puts of the interferometer, nx = n+ — n_. When the 
beam combiner, optical train, and detectors are sym¬ 
metric, the residual nx has zero mean. The total noise 
in /measured (x n ), expressed in Noise Equivalent Power, 
NEPtot, is the sum of the three noise variances: 

NEP t 2 ot = 2NEP 2 h + 2NEP 2 et + 2NEP s 2 ou , (9) 

where NEP p h and NEP sou are the noise from the back¬ 
ground (both astronomical and thermal) and source pho¬ 
ton noise, respectively, in one output, and NEP^et is 
the noise-equivalent power characterizing each detector’s 
noise (including phonon, readout and Johnson noise). 
The factor of 2 multiplies each term since we are con¬ 
sidering the difference of both outputs. The relation be¬ 
tween NEPtot an d the variance A|- of the noise nx during 
an interval dt is (lSromovskvll2003[) : 


2 _ NEP 2 ot 
‘ x 2dt 


( 10 ) 


For space instruments, the noise will likely be dominated 
by the sky background (zodiacal light, galactic cirrus 
emission, or optics thermal emission) and detector for a 
very large fraction of astronomical targets, which tend to 
be faint; for balloon instruments, emission from warm op¬ 
tics and the atmosphere sets the noise level in the far-IR. 
The detector is chosen to be a small fraction of the other 
noise factors in order to optimize its dynamic range. 


3.2. OPD noise 

Observing from the ground at optical wavelengths 
with a double-Fourier interferometer is limited by the 
phase coherence between the apertures, which is re¬ 
lat ed to the atmospheric coherence time, as discussed 
by iMariotti and Ridgwavl (I1988T ). The short coherence 
time forces fast scan rates, which degrades the sensi¬ 
tivity of the instrument due to short integration times 
and phase shifts between sequential scans. This is not a 
problem for flying platforms, since even at balloon alti- 
tudes the atmosph eric coherence is not a significant issue 
(iRizzo et al.ll2012lf . The major concerns for balloon and 
space missions are overall instrumental stability, knowl¬ 
edge of ZPD, and pointing errors, which can all con¬ 
tribute to OPD noise. 


OPD noise arises in an interferogram when the OPD at 
the time of a measurement is uncertain, hence compro¬ 
mising the reconstruction of the true x- value. Since this 
uncertainty is a physical delay 5 X , the error in phase is 
wavenumber dependent: 2Tr5 x a. 5 X is the difference be¬ 
tween the estimated x and the true x. For single-beam 
FTS instruments, internal laser metrology can provide 
optical path leng th measurem ents to high accuracy (e.g. 
iGriffiths and De Hasethl 12007D . and the separate paths 
the split beams need to travel can be kept small. For 
double-Fourier instruments, the entire optical paths up¬ 
stream of the beam combiner affect the OPD, hence it 
is more challenging to accurately measure and estimate 
the OPD contributors. In addition, common-mode point¬ 
ing errors of the collectors are geometrically converted to 
OPD errors. Hence, it is critical to know the position and 
orientation of the baseline vector with respect to the as¬ 
tronomical target with high accuracy in order to properly 
reconstruct the interferogram. 

For this analysis, we identify three timescales that can 
be used to examine the effects of OPD noise on the 
interferogram. These timescales are important to con¬ 
sider in the design of the OPD control system of any 
double-Fourier interferometer. Timescale 1 is the short¬ 
est and corresponds to the integration time for a single 
data point, typically a few milliseconds. In practice, this 
kind of OPD noise could be created by high-frequency 
mechanical jitters in the instrument (including the de¬ 
lay line bearing and motor, stiction behaviors and res¬ 
onant modes, reaction wheels and other self-induced vi¬ 
brations...). Timescale 2 is the time it takes to acquire 
one single interferogram over the full field of view and at 
the desired resolving power, typically on the order of sec¬ 
onds. The sources of noise that can affect this timescale 
include for example pointing errors and drifts, as well 
errors in the knowledge of the delay line position rela¬ 
tive to a reference ZPD. Finally, the longest timescale to 
be considered, timescale 3, is the time it takes to com¬ 
plete one full “track” by co-adding several consecutive 
interferograms to achieve the desired SNR, typically a 
few minutes long. During this timescale, it is expected 
that the change in baseline orientation on the sky does 
not produce any significant change in the source spatial 
visibility function. The latter timescale is most impor¬ 
tantly influenced by thermal variations and time-varying 
gradients that could change the optical alignment and 
mechanical configuration between the two arms. 


4. SPECTRAL SIGNAL-TO-NOISE RATIO 
4.1. Effects of Gaussian intensity noise 

In the presence of Gaussian intensity noise (thermal 
background and detector noise), the measured interfer¬ 
ogram is of the form of Eq. [8] We suppose that the 
noise has a variance A|- and zero mean, and is indepen¬ 
dent of delay position. In particular, this assumes that 
the source photon noise is negligible. The noise in the 
spectral domain is the transform of the noise in the in¬ 
terferogram domain: 


N / 2 — 1 

dxDFT{n x ) = dx J2 rii(x n )e 2innk/N , (11) 

n =— N /2 
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where the dx factor is to no rmalize the noise to a sam¬ 
pling bin dPress et al.lll992f) . n indexes the N discrete 
measurements in the interferogram, and k indexes the N 
discrete wavenumbers in the spectral domain. We keep 
this notation throughout the paper. The interferogram 
interval is symmetric with about ZPD (n=0). The noise 
variance is equal in the imaginary and the real domain, 
and can be expressed as the variance of the noise trans¬ 
form: 

A| = da: 2 VAR (Re(DFT(m))) , (12) 

where VAR is the variance operation. By writing out 
the variance we obtain: 

N [ 2 ~ 1 AT 

A| = dx 2 Aj ^2 cos 2 (27t nk / N) = —dx 2 A|, (13) 

n=—N/2 

where we used J2n=-N/2 cos 2 (27rnA:/A) = N/2 for 
k ^ 0 . 

The signal at wavenumuber at in the discrete spectrum 
S e (<Jk) observed at the detectors is: 

i /‘O’k-^Sa/2 

S e {<Jk) = t— / S e (a)da, (14) 

6(7 Ja k -Sa/2 

where 5a = (Ndx)~ 1 . A spectral line of power P e at ak 0 
will thus have an apparent flux density S e (< 7 fc) = NdxP e 
at k = fco and 0 for all other k. The signal-to-noise ratio 
in the spectrum can be expressed in general as: 


SNRfc 


Se(v k ) 
A 5 


/~ 2 ~ S e (a k ) 
V N dx Ax ' 


(15) 


Defining the central wavenumber of the band as a 0 , 
the spectral resolving power of the transformed interfer¬ 
ogram is 72. = dxNa 0 /2. We introduce the sampling pa¬ 
rameter s = ( aodx ) _1 which is the number of data sam¬ 
ples per fringe for the central wavenumber in the band. 
The spectral resolving power at the band center can now 
be written 72 = -!p. In practice one wants to pick a value 
of s that ensures Nyquist sampling on the fringe for all 
wavenumbers in the band so s ~ 3 or greater is typically 
preferred. For a given integration time per data point 
(given SNRx), increasing the fringe sampling effectively 
increases the amount of time spent on the fringe, so the 
spectral SNR should increase with yfs. Note that as long 
as we Nyquist-sample the fringe, there is no difference 
between multiplying the fringe sampling by some factor, 
and increasing the integration time per data point by the 
same factor, since in both cases the effective time on the 
fringe is equally increased. 

For continuum and low-spectral resolution measure¬ 
ments, it is useful to relate SNR*, to the SNR in the 
interferogram at the location of maximum intensity of 
the fringe, using physical quantities. The noise in each 
discrete measurement of the interferogram is Ax. The 
signal at maximum intensity is X max = daS, where da is 
the width of the bandpass filter and S is the average value 
of the signal in the band. Defining SNRx = X ma x/Ax, 
and noting that y/Ndx 2 / 2 = A. ^R/s, we obtain: 


SNR*, 


S e V2 

y/Ndx Ax 


(16) 


Thus for a given SNRx, the SNR in a channel of the 
final spectrum depends inversely on the square root of 
the resolving power 72 and the fractional bandwidth ^; 
and it depends directly on the square root of the number 
of samples per fringe y/s. 

For a given integration time, SNRx is proportional to 



, which means that SNRfc is inversely proportional 



Hence, maximizing the SNR in the interfero¬ 


gram by increasing the bandwidth does not lead to a bet¬ 
ter spectral SNR. Although the central fringe has more 
SNR with a larger bandwidth, the fringe envelope is de¬ 
creasing more rapidly, and we see less fringes with good 
SNR. With a smaller bandwidth, the envelope is broader 
and although each fringe has less SNR, we are seeing 
more of them within the scan, so we spend more time in 
an interesting region of the interferogram. The number 
of fringes within the main lobe of a standard sine function 
is indeed inversely proportional to the fractional band¬ 
width. This is an important concept that can be used to 
tune the spectral resolving power appropriately: if the 
instrument has a large bandwidth, increasing the scan 
length (hence 72) well beyond the fringe pattern’s main 
lobe will hurt the sensitivity, since many data points will 
be adding noise and almost no signal. The dependencies 
discussed above can be summarized as: 


SNR 2 oc 



(17) 


4.2. Effects of Gaussian OPD noise 

This section derives analytic expressions for the effects 
of Gaussian-distributed OPD noise. We look at the gen¬ 
eral case in order to derive sensitivities for double-Fourier 
instruments. Here, we suppose that the OPD from the 
delay line, the OPD within each arm of the instrument, 
and the OPD caused by an off-axis source are all mea¬ 
sured or estimated with some residual error. Hence, the 
data points measured in the interferogram are associated 
with a delay value relative to ZPD, and if necessary, re¬ 
sampled to produce an evenly-spaced delay axis. This 
is necessary to use the FT and retrieve the spectrum. 
The noise on the delay estimate can be characterized as 
a wavenumber-dependent phase error in the interference 
on the two beams. In the following, we quantify the 
impact of this noise on the spectral SNR, in order to un¬ 
derstand how good our knowledge of the OPD needs to 
be to make sure the OPD noise effects are not dominant. 

Let’s consider a single frequency signal first, so that 
the phase is proportional to the OPD. If we suppose 
that these residual phase errors < h r .(x) are represented 
by a Gaussian distribution with zero mean and variance 
A|,, then the primary effect of the noise is to change the 
instantaneous power in /( x) by the factor e* 4 ’’ - W. Now 
we consider a large ensemble of realizations of this noise 
distribution in order to predict its effect on the SNR. Us¬ 
ing the expression from iRichard d (1200311 . for sufficiently 
small phase errors (< 7r radians), the intensity of the co¬ 
herent signal is reduced, on average, by a factor e~ A */ 2 . 
For Gaussian-distributed OPD uncertainties with stan¬ 
dard deviation A/20, where A is the wavelength, the sig¬ 
nal intensity is reduced by 5%; for A/10 the amplitude is 
reduced by 18%. To give a practical example of the im- 
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pact of this effect, we can consider the case of BETTII: 
if we assume that the uncertainty in the attitude of the 
payload is the only source of OPD noise, then knowing 
the attitude to within 0 . 1 ” rms will reduce the signal, on 
average, by 18% at 40 ^m. 

For the polychromatic case, the delay position uncer¬ 
tainty, S x , creates larger phase errors the shorter the 
wavelength, <f> r (fc) = 2ir8 x ak- A given OPD error dis¬ 
tribution of variance Aq PD yields a degradation across 
the band, e - A *W/ 2 , with A|(fc) = (2 tt) 2 A 2 OPD a 2 k . 

Of course, the power lost from the coherent fringe pat¬ 
tern is still present in the scan; it becomes part of the 
incoherent signal seen by each output. In the limit where 
there is no spectral noise from the background or detec¬ 
tors, defining Sk = S e (ak ) we have: 

5,. e -Aj(fc)/ 2 

= ( 18 ) 

where k' designates an index on all positive wavenumber 
bins. Note that N = 2s72. Thi s rela tionship is identical 
to the one derived bv lMevnartl (|1992f ). and we suggest an 
alternate and more detailed justification for it (see Ap¬ 
pendix [Bj . Studying this relationship, all the wavenum¬ 
bers contribute to the white noise at a given wavenumber 
(jfc. The strongest lines (strongest S 2 ,) and the shortest 

wavelengths (strongest 1 — e~ A *( k ■*) contribute the most 
to the overall noise. To summarize, considering an en¬ 
semble average of interferograms, OPD noise degrades 
the spectral SNR in two ways: first, it reduces the over¬ 
all signal in the interferogram; second, it converts this 
lost power into white noise. 

More realistically, observations will have both intensity 
and OPD-generated spectral noise. In this case, the in¬ 
tensity noise and the scattered power add in quadrature 
to give: 

dfcE*- [Sl,(l-e-^’))\+sKdx^l 

(19) 

The numerator of Eq. [TO] shows that any amount of 
OPD noise will reduce the spectral SNR. However, the 
impact of OPD noise is even greater when the power 
lost from the fringe is comparable to the intensity noise, 
as the first term of the denominator starts to matter. 
In fact, for arbitrarily large source fluxes, this equation 
reaches an asymptotic value which depends only on the 
OPD noise, and sets the maximum SNR achievable on 
average in a single scan. This is relevant for astronomi¬ 
cal calibrators which can be so bright that the intensity 
noise term is negligible. In that case, assuming constant 
OPD noise, more SNR is only achievable by co-adding 
consecutive scans, as we discuss in the next section and in 
Appendix C. For most astronomical applications, where 
targets are usually faint compared to the intensity noise, 
it is expected that the first term of the denominator will 
be negligible. 

4 . 3 . Co-adding interferograms 

Eq.[TO]is the general case of a single interferogram with 
OPD and intensity noise. In practice, we would co-add 
M interferograms in one “track” to build up SNR, but 


this puts stringent requirements on the performance of 
the control system and OPD estimator, because consec¬ 
utive interferograms need to stay aligned with each other 
to within a small fraction of the carrier wavelength, to 
avoid causing OPD noise. The design and performance 
of the OPD estimator is highly implementation-specific, 
but most balloon and space designs will likely include an 
estimator that either directly measures the OPD, or indi¬ 
rectly infers it from the measurement of another quantity. 

A direct OPD measurement can be achieved for exam¬ 
ple with a fringe-tracking instrument, while an indirect 
OPD estimate can be an attitude measurement, which 
can be related to the OPD by simple geometry by using 
some assumptions. The latter scheme only works if the 
OPD errors are only influenced by pointing uncertainties 
over the timescale of a track, and that all other OPD con¬ 
tributors are modeled and corrected with comparatively 
high fidelity. The spectral SNR over M scans can be de¬ 
termined from Eq. [TO] by multiplying the whole equation 
by a factor of y/M. The OPD noise term causing the 
phase noise variance A|, then corresponds to the vari¬ 
ance of the OPD uncertainties for each point of a scan, 
plus the variance of the OPD estimation error in deter¬ 
mining the position of the center of each scan, which is 
necessary to properly co-align them (Appendix C). 

4 . 4 . Implications for spectroscopy 

A primary application for BETTII and proposed mis¬ 
sions like SPIRIT will be the measurement of the spec¬ 
tral energy distribution from warm dust associated with 
star formation in different environments. These types 
of measurements require broad wavelength coverage but 
not especially high spectral resolution since the emission 
can be characterized as a sum of Planck functions over a 
range of temperatures. For an instrument like BETTII, 
covering from 30-55 fiin and 60-90 /im simultaneously, 
72 ~ 10 in each band is sufficient to accomplish much of 
the science. 

Spectral measurement with 72 ~ 10 requires covering 
a delay range of ±10 Ao for a single source. On the other 
hand, a delay range of 35-70 Ao (see Eq. ED is needed to 
move ZPD across 1 arc-minute of sky. Hence, typically, 
the delay requirements for spatial coverage creates inter¬ 
ferograms with higher resolution than needed to measure 
the continuum, and the full scan needs to be cut into 
smaller arrays around each target in the field. The size 
of these smaller arrays depends on the desired spectral 
resolving power 72, and the required sensitivity, as shown 
in Eq. [TO] However, the additional data can be used for 
higher-resolution spectroscopy, for example to measure 
specific atomic lines in the far-IR. The SNR for lines is 
actually increasing with the square root of the number of 
data points in the interferogram, as the broadband noise 
gets more diluted in increasingly narrower spectral bins 
(see Eq. [TO] [TOD. 

As discussed for FTS instruments (e.g. iDavis et~aD 
1200 11 1 , apodization, the weighting of the points of the 
measured interferogram before applying the DFT, is one 
method for optimizing the SNR in the spectrum. The 
weight scheme is optimized to measure a specific type of 
spectrum: narrow line, broad features, continuum. The 
method relies on the fact that the data points close to 
the center or edges of a fringe packet contain informa- 


SNR fc = 















tion about low or high spectral frequencies, respectively. 
For example, if the purpose of an observation is to study 
continuum, it is appropriate to apply smaller weights to 
data points far away from the central fringe, since they 
add noise and very little SNR. 

A common low-resolution spectroscopy case can be de¬ 
rived analytically if a source has a spectrum following a 
power law distribution over the covered band. We can 
write <S(ct) oc ct“ where the exponent a is the quantity of 
interest. Several methods have been developed to prop¬ 
erly fit these power l aws using maximum entropy and 
other techniques (e.g. IClauset et al.ll2007h . Here we use 
a simple estimator and provide a ready-to-use formula 
to help quantify the sensitivity of double-Fourier instru¬ 
ments. 

By taking the logarithm of the spectrum, the problem 
is turned into a weighted linear fit in log-log space, where 
we want to determine the slope of a line. The noise 

A 5 = Ag/S = 


d( ln(5)) 
dS 


in the new domain is Al = 

l/SNR^. The weights Wk = 1/A); of the linear fit are 
then simply the values of the spectral SNR squared at 
each data point, SNR 2 . The e rror on the we ig hted least 
squar e estimate of the slope is dBevington and Robinsonl 

Hxl): 


A 2 = 


E W k 


E w k E w k^k - (E w k x k ) 


2 ’ 


( 20 ) 


where Xk = ln(afc) is the natural logarithm of the 
wavenumber for data point k. In the case of uniform 
spectral signal-to-noise ratio SNR 5 over to points of the 
spectrum, this expression simplifies to: 


A 


2 

a 


1 

to x SNR| x VAR (X k )' 


( 21 ) 


This equation indicates that the variance of the spectral 
index estimate decreases with the number of points used 
to calculate the estimate, the spectral SNR squared, and 
the variance of the points distribution on the logarithmic 
wavenumber axis. For example, for 10 spectral points 
spread evenly from 30 to 55 /tm, each with a spectral 
SNR of 5, we obtain an error on the slope determination 
A q - 0.3. 


5. SPECTRAL SENSITIVITY ANALYSIS FOR BETTII 

This section applies elements of the above discussion to 
BETTII. A general discussion on the details of BETTII 
can be found in lRinehart et all (l2014f) . On BETTII, two 
mirrors collect light with an altitude-azimuth pointing 
system. The truss that holds the two mirrors moves in az¬ 
imuth and determines the baseline vector, while the mir¬ 
rors themselves move only in elevation. While BETTII 
does not physically rotate about the line of sight to cover 
different baseline angles, the payload always stays hori¬ 
zontal and the projection of its baseline vector changes 
as a source moves across the sky, hence covering differ¬ 
ent angles in the ( u , u)-plane. The absolute OPD and 
ZPD of the instrument cannot be measured, maintained, 
or known with perfect accuracy, especially during the 
flight itself, due to attitude estimation errors leading to 
our inability to perfectly estimate the orientation of the 
baseline vector in real time. In fact, a significant compo¬ 
nent of the mission’s design and implementation involves 


the selection and coordination of the suite of instruments 
which provide attitude measurements to construct the 
OPD estimator. 

A second relevant aspect of BETTII is that the detec- 
tors are cryogenic bolometers (see IStaguhn et all 12014 
for similar architectures) with 1 /f noise which sets an 
optimal read-out time for the detectors of around 2.5 
milliseconds (timescale 1). With BETTII’s designed field 
coverage of 2 areminutes, full field scans consist of 1024 
points and take 3 seconds to complete (timescale 2). Due 
to thermal emission from the atmosphere, warm mirrors, 
and cryostat windows, BETTII will be in the background 
noise limited case for all science targets. It is anticipated 
that 200 scans will typically be co-added to create one 
single visibility measurement over 10 minutes (timescale 
3). For most source locations, the variation of the base¬ 
line orientation due to change in parallactic angle is not 
significant over this period. 

5.1. Noise sources and control system 

Table Q] shows our estimates of the background power 
levels associated with the atmosphere, warm optics, and 
windows in the two BETTII bands. The detectors them¬ 
selves have been designed to have a noise level compara¬ 
ble to the background to optimize the use of the dynamic 
range of the devices. The total NEPs of the short and 
long bands are expected to be ~ 2 x 10 -15 W.Hz -0 5 and 
~ 1 x 10 -15 W.Hz -0 - 5 , respectively. The source photon 
noise is negligible compared to the total NEP. 

Balloon instruments are subject to low frequency 
(< 0.5 Hz) pendulum modes and other oscillations intro¬ 
duced by the system’s geometry and mass distribution, 
which make pointing a challenge. However, it is expected 
that the balloon environment is free of perturbations at 
any higher frequency (other than the instrument specific 
perturbations). Hence, sensors with high electrical band¬ 
width can robustly estimate the pendulum modes to gain 
accurate knowledge of the attitude, which can be used as 
our indirect OPD estimator since it is geometrically re¬ 
lated to the phase on sufficiently short timescales. 

The BETTII control system is organized with three 
different levels of control loops (iRizzo et all [2014 1: the 
coarse pointing loop, the fine pointing loop, and the OPD 
loop. The coarse pointing loop uses gyroscopes and star 
cameras to keep the baseline oriented within 10-15” of 
an appropriate near-IR guide star. A dichroic splits the 
near-IR (1-2 /mi) from the far-IR (30-90 /im) inside the 
cryostat before the scanning delay line. The guide star 
is imaged through each of the two arms on two separate 
readout windows of a near-IR detector array that shares 
most of the optical path with the science channels. The 
fine control loop uses fast-steering tip-tilt mirrors, lo¬ 
cated at the pupils of each arm, to control the guide star 
image on each window and maintain good overlap of the 
beams at the science detectors. This loop reads the near- 
IR detector and generates a tip/tilt correction at 100 Hz. 
We expect to achieve beam overlap to within better than 
1.5” at all times when a guide star is available. The spa¬ 
tial resolution of an individual BETTII beam is 17” in 
the short wavelength band so this is a little better than 
l/10th of a resolution element. The interferometric vis¬ 
ibility loss for this overlap error is anticipated to be less 
than 0.5%. 

We do not expect to be able to maintain the three di- 
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mensional orientation of the truss, and hence the baseline 
vector, to much better than 10” rms, due to the various 
pendulum modes mentioned above and large inertia of 
the payload. However, the errors in OPD introduced by 
pointing errors can be corrected directly using a delay 
line. BETTII uses a delay line external to the cryostat 
to correct the OPD at the entrance of the cryogenic vol¬ 
ume. This delay line is completely separate from the 
science delay line which scans the OPD to produce the 
interferogram. Two delay lines are not a requirement 
for a double-Fourier instrument in general as the job can 
be done in theory by a single mechanism, with sufficient 
range and mechanical bandwidth. The external delay 
line on BETTII allows for the possible future upgrade of 
correcting and monitoring the OPD outside of the cryo¬ 
stat usi ng the near- IR channel by implementing a fringe 
tracker (iR.izzo ct al.ll2012l ). 

For the OPD loop on BETTII, the angles of the tip/tilt 
mirrors which are used to maintain overlap of the beams 
act as an estimator of the baseline orientation, and hence 
as an indirect estimator of the OPD. The attitude esti¬ 
mates computed from these angles are converted to OPD 
and fed to the external delay line so that the OPD at 
the entrance of the cryostat stays as constant as possi¬ 
ble. Because the pendulation modes have periods of a 
few to tens of seconds and should be well-behaved, we 
expect to be able to trust the control signals and esti¬ 
mate the attitude of the baseline vector to ~ 0.12” rms, 
which corresponds to a fifth of a detector pixel in the 
near-IR tracking array. A 0.12” attitude error indirectly 
corresponds to a delay uncertainty of 5 pm, or 12% of 
a wavelength at 40 pm. This is a critical consideration 
when co-adding consecutive interferograms. With this 
amount of OPD noise we expect, on average, a ~ 25% 
degradation in SNR for all sources in the short band, 
simply from the effects of phase noise in reducing the 
coherent signal (see Eq. [TUI) . 

Even with a stable OPD estimator, the absolute ZPD 
of the instrument must be measured during flight and 
tracked over long timescales as the instrument and the 
truss cool down to ambient temperatures (~240 K). This 
can be accomplished by observing a bright point source 
with known position periodically during a flight and iden¬ 
tifying the center of the interferogram response (see Ap¬ 
pendix C). 

5.2. Derived sensitivity and faintest detectable targets 

Incorporating these sources of noise with the formu¬ 
las derived in the previous sections leads to the sensitiv¬ 
ity values shown in Table [2] In this table we show the 
sensitivity in the two bands. The minimum detectable 
flux density (MDFD), which is the flux that provides 
SNRx = 1 in a single interferogram, is 15 Jy and 26 Jy 
in band 1 and 2 respectively. For 200 scans averaged 
with a OPD noise between scans of 5 pm, the MDFD 
is 1 Jy and 2 Jy, usin g a matched fi lter efficiency of 0.5 
and 0.4, respectively (iMighelll 120051 ). The faintest de¬ 
tectable spectroscopic point source that leads to a spec¬ 
tral SNR = 5 is 25 Jy and 13 Jy, respectively. These are 
determined for “normal observing”, which consists of co¬ 
adding 200 scans in 10 minutes that span the whole 2’x2’ 
field of view, using a spectral resolution of 72 = 10 and a 
nominal OPD noise of 5 pm rms. 

At the bottom of the table, we also show the results 


in case we were using the instrument in an “enhanced 
sensitivity” mode. This mode is mentioned here to il¬ 
lustrate the flexibility of the interferometer and its ob¬ 
serving modes. It consists of increasing the individual 
integration time for each point in the interferogram by 
a factor of 3, while reducing the interferometric field of 
view by the same factor of 3: while the intrinsic field 
of view is unchanged at the detector, for the same scan 
time we only cover enough OPD range to cross ZPD for 
a subset of the pixels of the detector (and obtain a scan 
of the same length). This mode could be used for exam¬ 
ple for isolated targets which are located in less crowded 
star fields, by optimizing the time spent close to ZPD, 
where there is more signal (as we are interested in low- 
resolution spectroscopy). BETTII’s observing parame¬ 
ters can be changed during flight so that the instrument 
stays flexible to optimize the chance of seeing fringes. 

Finally, we show the overall sensitivity as a function 
of point source flux density (Eq. [TUI) for both observ¬ 
ing modes and both bands in Figure [4] In normal 
background-limited regime, the sensitivity curves should 
be straight lines. Here, OPD noise creates a decrease in 
overall sensitivity as a reduction in coherent power, but 
also, for brighter targets, from the power lost from the 
fringe that is converted to white noise (which causes a 
deviation from straight lines). For very bright targets of 
50 Jy or more, it is possible to measure the OPD accu¬ 
rately within each interferogram by tracking the fringes 
in the science channels themselves (see Appendix C). For 
sufficiently large SNR, this process has less error than 
the assumed 5 pm OPD noise coming from the indirect 
OPD estimation, so the OPD noise decreases for these 
very large fluxes to become negligible. This is particu¬ 
larly attractive for in-flight testing and calibration. 

It is important to note that for sufficiently faint targets, 
it is impossible to accurately measure the OPD using 
single scans or co-adds of scans: we rely on the OPD 
estimator to have sufficient stability to properly co-add 
scans until the next calibration measurement. This needs 
to be considered carefully when planning the observation 
strategy, as long stretches without calibration could lead 
to a total loss of the OPD information (hence a total loss 
in scientific data), due to other OPD noise contributors 
such as thermal drifts that impact the payload on long 
timescales. 

6. CONCLUSION 

Spatio-spectral interferometry can enable high resolu¬ 
tion spectral imaging of wide fields at far-IR wavelengths. 
Implementation of the technique provides some new in¬ 
strumental challenges compared to traditional Fourier 
Transform Spectroscopy, such as the fact that the mea¬ 
sured spectrum is a mix of the source’s spectral and spa¬ 
tial information. 

In a double-Fourier system, the zero path difference 
for each detector pixel occurs at a different delay setting 
of the delay line. The delay stroke needed to cover a 
scientifically interesting field of view is equivalent to a 
spectral resolving power of 100’s to 1000’s for the central 
pixels. 

We present an analysis of the impact of Gaussian inten¬ 
sity and OPD noise on the spectral sensitivity. Intensity 
noise, essentially thermal noise from the optics, sky, as- 
trophysical background, and detector, is similar to noise 
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in FTS systems with the exception that the longer scan 
lengths required to cover the spatial field add noise; this 
can be mitigated by cutting the interferogram for each 
pixel into smaller arrays centered on each source’s ZPD 
to match the desired spectral resolving power, and by 
apodizing the interferogram to increase sensitivity to the 
spectral properties of interest. We show that the spec¬ 
tral SNR is inversely proportional to the square root of 
the spectral resolving power and the square root of the 
fractional bandwidth, in contrast to the SNR on the cen¬ 
tral interferometric fringe which is directly proportional 
to the latter. 

OPD noise is not usually relevant for FTS systems, but 
is intrinsic to double-Fourier instruments, since the two 
incoming beams go through long separate paths before 
combination. For instruments on balloons or in space, 
the OPD noise is expected to be dominated by distur¬ 
bances from the instrument and from pointing errors. 
On average, OPD noise reduces the coherent power in 
the interferogram, and converts the power lost from the 
fringe into additional white noise in the spectrum. We 
argue that there are three relevant noise timescales: the 
time to take a single data point, the time to collect a 
complete interferogram, and the time to co-add M inter- 
ferograms together in a track. The latter corresponds to 
the timescale that the source spatial visibility function 


changes significantly, due to the rotation of the baseline 
angle on the sky. 

We derive the spectral sensitivity of double-Fourier in¬ 
struments to intensity and OPD noise. The expressions 
in this paper are derived in the general case and can 
be used to design any instrument that implements this 
method. 

Applied to the case of BETTII, these equations lead 
to spectral sensitivity estimates of 25 and 13 Jy in its 
30-55 /m i and 60-90 yLvn bands, respectively, to achieve 
a spectral SNR = 5 in 10 minutes with TZ = 10 and an 
assumed OPD noise of 5 /iiri rms. 
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Maryland. Work by T. Veach was supported by an ap¬ 
pointment to the NASA Postdoctoral Program at GSFC, 
administered by the Oak Ridge Associated Universities 
under contract with NASA. We would like to thank the 
anonymous referee for suggested improvements to the pa¬ 
per. 


APPENDIX 


A. DERIVING THE INTERFEROGRAM EQUATION IN A DOUBLE FOURIER SYSTEM 

The interferogram from a double-Fourier system is different from the interferogram for an FTS in several ways that 
derive from the fact that the double-Fourier system starts with two independent input beams viewing the same astro¬ 
nomical target. For this derivation, we will follow the convention in the FTS literature and consider the propagation 
of a single plane wave (radiation from a point source at infinity) at wavenumber a = 1/A through the system. 

Figure 2 in the main text shows the setup for a typical double-Fourier system with the K-mirror on one arm to keep 
the sky images at the same rotation on the two paths, and the delay line in the other arm to allow adjustment of the 
relative path lengths between path 1 and 2. The plane wave travels a distance x\ on path 1 from an entrance aperture 
an arbitrary distance above the siderostat to the beam combiner: ai(a)e^ 2 ' KlaXl+ ^ , where a\ is the amplitude of the 
electric field and <f> corresponds to an arbitrary phase offset. For convenience of notation, in the following derivation 
we drop the amplitudes’ dependence on wavenumber by writing a\ instead of ai(<r). 

The wave also undergoes phase shifts caused by reflections and partial reflections along the path. A full reflection 
for light traveling in air or a vacuum causes a 180 deg phase shift; a 50% reflectio n at the beam splitter/combiner 
causes a 90 deg phase shift between reflected and transmitted beam (lLawso 1|200(J. Since the instrument measures 
the combined light at the detectors, what matters is the difference in the numbers of reflections along path 1 and 2. 
In the case of the particular BETTII implementation, path 1 contains one more reflection than path 2. 

The electrical fields arriving at the “+” and detectors are then: 


A = aie ~ 27TicrXl +* 7r + i7r / 2 + < ^ _j_ a2e~ 2 ' KiaX2+ ^ > 

^ _ ai g — 2iri<7Xi+iv+<j> _|_ a ^ e ~2-Kiax 2 +iiT/2+c/> 

where the tt phase shift on path 1 occurs because there is one extra reflection compared to path 2 (see Fig. H, and (f> 
corresponds to an arbitrary phase offset. The detectors are power detectors so defining the intensity I = A*A: 


(Al) 

(A2) 


I_ = a\ + a\ + aia 2 ^ e - 27riCT(xi " X2)+3i7r/2 + e 2 ™(*i-®a)-3*7r/2 
I + = a\ + a 2 2 + oio 2 fe- 2nia{xi - X2)+i7T/2 + e 2 ™^ 


(A3) 

(A4) 


Defining x = X \ — Xi and expanding the complex exponentials, the equations can be simplified to: 


I- — {al + al) ^1 

I + = {a 2 + a 2 ) (l 


‘2d\CL2 
a\ + o| 


sin( 27 T(Ta:) 


2 oia 2 . . 

H—5- t ; sm(27rcra; 

CL l + G&2 


(A5) 

(A6) 
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where x is now the difference in the physical length between the two light paths. For the case of equal wave amplitudes 
on path 1 and 2 (ai = a 2 = a): 

I± = 2a 2 (l ± sin(27rcra;)). (A7) 


The generalization of this equation to a source distribution on the sky requires the recognition that a\ and ci 2 are 
complex values such that |ai| 2 (cr) and |a 2 | 2 (cr) are power from the source at wavenumber cx, while a\ a 2 is the correlated 
power seen through the two apertures which is the source spatial visibility, q(B, a), and is in general a complex valued 
function. 7 (B, a), which is a function of the baseline vector B connecting the two light collectors, and <r, is the Fourier 
transform of the source emission distribution on the sky. For the general case, the previous equations become: 

/_ = | ai | 2 + |a 2 | 2 + 7 (B, CT )g—2™(*1 —Z2)+3i7T/2 + 7 *( Bi CT ) e 27( A8 ) 
1+ = 1 0 , 1 1 2 + 1 02 1 2 + 7 (B, ff )e- 2 ™(^i-^)+»/2 + 7 *( Bj ^ e 2,rte(* 1 - IJ )-i 1 r/S i ( A 9) 


The same simplification as before can be done except that 7(B,ct) is a complex-valued function, 
normalized spatial visibility as 


Vb(ct) 


2b(B, a ) 
a\+a 2 2 ’ 


If we define the 
(A10) 


then the equation for I± becomes: 


I ± = (|«i| 2 + 1021 2 ) [1 ± (Re (Vb(ct)) sin(27TCTa:) — Im (Vb(ct)) cos(27rcra:))] , (All) 

i± = (|ai| 2 + |a 2 | 2 ) [l ± Re (iV B (cr)e _2 ’ ri<TX )] , (A12) 

where Re(/) is the real component of / and Im(/) is the imaginary component. 

The same style of derivation can be done with for a realistic instrument with a complex transfer function. If we 
characterize the spectral transmission function as t\(a) = |fi(cr)|e I$l( 7) along path 1, and ^(c) = |^ 2 (o') |e z<&2 on 
path 2, then the amplitude mismatch of the spectral transmission function in each path reduces the power in the 
interferogram and the phase differences introduce a phase factor <17 = $1 — $2 into the exponential term. As a 
result, the source visibility in the previous equations is multiplied by a normalized, instrumental visibility loss term, 

I± = (M 2 M 2 + M 2 |a 2 | 2 ) [l±Re(*V B (a)V»e- 2 ^*)] . (A13) 


B. SPECTRAL NOISE IN PRESENCE OF GAUSSIAN PHASE NOISE 

Suppose that the signal is a line of power density 2 S centered on bin number k corresponding to wavenumber ak¬ 
in the complex interferogram, the line has a power density S in bin fc and —S at —k, and zero everywhere else. To 
simplify the analysis, let’s focus on the positive frequencies, which only contain half the noise. The interferogram at 
delay x n = ndx is I k {x n ) = SSae~ 2 ™ akXn . Through a simple DFT, the value of the line in the spectrum in ideal 
conditions is: 


N / 2-1 IV/2-1 

dxT>¥T{l k {x n ))[k']=dx s6ae- 2i ™ kX "e 2innk '/ N = dx Y SSae~ 2i ^ k - k ' )n/N , (Bl) 

n =— N /2 n =— N /2 

which is equal to dxNSSa = S for k = k’ and zero everywhere else. Note that we have akX n = kSandx = kn/N. and 
Sa = (Ndx ) -1 . The phase noise degrades the effective power of the line, so it is now l Se _A «'/ 2 (lRichardsll2003f) . The 
noisy interferogram is T k (x n ) = S5ae~ 2l ^ kn ^ N e 1 ^ A x »). 

Designating the operator () as the ensemble average, the noise in the interferogram is the variance of the DFT: 


A|[fc'] = VAR(da:DFT(T fc (j: n ))[fc']) 


= dx 


Y Ik ( xn ) 


0 2 iTtnk ' /N 


YMxn)e 2 ™ nk ‘ /N 

n / 

= 1dx2 f E E e 2 ^”-">'/iv -YY, ( Ik (*")> e 2 ^"-”')*7^ ; 

\ n n ' n n ' / 

= dx2 YY m(x n )T* k (x n ,)) - (l k (x n )) (l* k (x n 0)] e 2 -("-"')*=7 N, 


(B2) 

(B3) 

(B4) 

(B5) 


We can write (X k (x n )Il(x n >)) = (S 2 Sa 2 e~ 2ln ^ n ~ n ) fc / Jv e 7 ,J>r 7 n)-$r(x n /))^ This quantity is equal to 

when n ^ n' and equal to S 2 Sa 2 when n = n'. The quantity (X k (x n ))(Xl(x n ')) is equal 
to S 2 5a 2 e~ 2l ™ ( ' n - n ') k / N e~ A n for all n and n!. Hence, the term in the sum is nonzero only for n = n’ , for which it is 
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S 2 Sa 2 (l — e A *). 


The value of the sum is then: 


dx 2 ^2 S 2 Sa 2 (l — e A *), 

(B6) 

dx 2 NS 2 6cr 2 (l — e -A *), 

(B7) 


(B8) 


This quantity is independent of k !, so the noise is white. The negative frequencies contribute the same amount, 
doubling the noise variance. However, we are only considering the imaginary part of the spectrum, so only half the 
noise variance is important in our calculation of our SNR. The last expression thus represents the variance of the noise 
that is useful for our SNR calculations. 


C. FRINGE TRACKING IN THE SCIENCE CHANNELS 

For sufficiently bright sources, it is possible to self-calibrate the OPD between subsets of the M interferograms in 
a track, to prevent the drift of an indirect OPD estimator. The idea is to bin consecutive interferograms in subsets 
in order to build up enough SNR to clearly see a fringe and be able to estimate its position with sufficient accuracy. 
Then, the different subsets within a track can be offset and co-added with better accuracy (smaller OPD noise) than if 
we were co-adding the M interferogram individually with only the instrument OPD estimator noise. The best scenario 
would be when the fringe has a high SNR in each single interferogram - which will be the case of calibrators for 
BETTII. 

There are many ways to fit the location of the fringe center, and the error associated with each meth od is h ighly 
imple mentation-specific. Here, we consider the simple example of a fringe tracking algorithm in two steps (jRizzo et al.l 
I2012H : a Hilbert transform of the interferogram to obtain its envelope; and a centroid of the points of the envelope 
above a certain SNRx threshold. The Hilbert transform doubles the error variance in the interferogram, and in 
the worst case, the centroid has an error variance of approximately (n x SNR^) -1 , where n is the number of data 
points above the threshold SNRx- The conversion to a phase leads to a phase error variance equal to [A| (cr)] direct ~ 
2 x (27t) 2 a 2 / <Tq /(n x SNRx). This indicates that when the SNR is high enough, this direct estimate of the phase can 
become better than the estimate coming from an indirect OPD estimator with corresponding phase error variance 
[A| (o’)] indirect, like the attitude estimator used on BETTII. 

In Figured! we use Ea. 1191 and a total phase error variance which is a combination of the phase noise from the direct 
and indirect methods, to ensure continuity: 

a2 / 1 1 

UA|(a)] direct + [A|((j)]indirect 

On BETTII, the bulk of the phase noise comes from the uncertainties in co-adding consecutive scans (timescale 3), 
as the estimator uses an indirect method and never really measures the absolute phase for low-SNR targets. For 
high-SNR targets, the method described above can serve as a fringe tracker that not only is useful for calibration, but 
can also substantially improve the phase estimator’s stability over long periods of time by preventing drifts. 
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Parameter 

Band 1 

Band 2 

Comment 

Window emissivity 

0.02 

0.02 

Measured in the lab 

Telescope emissivity 

0.077 

0.077 

10 mirrors at 0.992 reflection 

Sky radiance 

0.16 W.m li! .sr 1 

0.07 W.m' i .sr 1 

Harries 11980') 

Window radiance 

0.17 W.m -2 .sr —1 

0.04 W.m Asr 1 

Blackbody at 240 K 

Telescope radiance 

0.17 W.m -2 .sr —1 

0.04 W.m - Asr -1 

Blackbody at 240 K 

Total optical efficiency 

0.3 

0.3 

Per arm, includes detectors 

Photon power from the sky 

35 pW 

36 pW 


Photon power from the window 

40 pW 

18 pW 


Photon power from the telescope 

108 pW 

38 pW 


Total absorbed photon NEP 

1.4xl0 -15 W.Hz -0,5 

7xl0 -16 W.Hz- u - 5 

From each arm 

Detector NEP 

3xl0 -16 W.Hz-°- 5 

3xl0 -16 W.Hz- u - 5 

For each detector 

Total NEP 

2xl0 -15 W.Hz -0 5 

lxlO -15 W.Hz- u - 5 

For the sum of both arms 


TABLE 1 

BETTII NOISE PARAMETERS 


Single scan 


Band 1 

Band 2 

SNR Target 

MDFD 

15 Jy 

26 Jy 

SNRx = 1 

Normal observing (200 scans, 

10 min) 

MDFD 

i Jy 

2 Jy 

SNRx = 1 

Faintest pt. source 

25 Jy 

13 Jy 

SNRfc = 5 

Enhanced sensitivity (200 scans 

10 min) 

Faintest pt. source 

14 Jy 

7 Jy 

SNR fe = 5 


TABLE 2 

BETTII SENSITIVITY ESTIMATES 



Fig. 1.— Concept of wide-field double-Fourier interferometry. Light from the instrument is focused after combination to an image of the 
sky on the detector array (represented as the grid). Each column of the detector has a distinct ZPD so the interferometric responses (right 
side) of two sources on different columns are centered around different delay positions. The gray stripe represents the central column on 
the detector array and its corresponding ZPD on the interferograms. 
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Fig. 2.— Optical train diagram of a typical far-IR, double-Fourier instrument. The K-mirror rotates the beam to align the fields of view 
of the two sides. Inside the cryostat, a set of optics re-image the pupil, implement a controlled instrumental delay between them with the 
Delay Line, and relay them towards the central be am combiner. A f ter the combiner, the beams are imaged onto the detectors. To see the 
BETTII-specific implementation of this design, see I Rinehart etaLl (12014D . 



Fig. 3.— Effects of phase and intensity noise on the recovered spectrum (single realization of the noise). Left column: normalized 
interferograms, intensity as function of OPD. Right column: normalized DFT of interferograms. Solid: input spectrum multiplied by 
anti-symmetric transmission function; Solid circles: Imaginary part of DFT from interferogram; Dotted: Real part of DFT. First row: ideal 
measured signal, no noise; used for normalization of all other plots. Second row: results with a realization of phase noise of 10% at each 
point of the interferogram. Third row: results with a realization of intensity noise and SNRx = 10. 
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Fig. 4. — BETTII’s spectral sensitivity. Solid: Normal observing mode, band 1; Dashed: Enhanced sensitivity mode, band 1; Dotted: 
normal observing mode, band 2; Dot-dashed: Enhanced sensitivity mode, band 2. This plot includes the technique of fringe tracking in 
the science channel for sufficiently bright sources (see Appendix C). As the source flux rises, the effects of the phase noise become larger 
and the SNR should reach an asymptotic value. However, with fringe tracking, the phase noise itself becomes smaller since one can see 
fringes in one single or a few consecutive scans, so the co-adding becomes easier. Thanks to the fringe-tracking, there is no regime where 
the phase noise is expected to be dominant on BETTII, provided that the control system performs according to expectations. 





